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Image Processing Apparatus, Image Processing Method, 
and Recording Medium Recorded with Image Processing 
Program to Process Image Taking into Consideration 
Difference in Image Pickup Condition Using AAM 

This application is based on Japanese Patent Application No. 11- 
168690 filed in Japan, the contents of which are hereby incorporated by 
reference. 

BACKGROUND OF THE INVENTION 
Field of the Invention 

The present invention relates to an image processing apparatus, an 
image processing method, and a recording medium recorded with an image 
processing program. More particularly, the present invention relates to an 
image processing apparatus, an image processing method, and a recording 
medium recorded with an image processing program to process an image 
taking into consideration difference in the pickup condition of a face image 
using an AAM (Active Appearance Model). 
Description of the Related Art 

As a method of synthesizing a natural face image by analyzing 
statistically many face images without using a three-dimensional shape, 
Cootes et al. of University of Manchester have proposed a method using an 
AAM. The method using an AAM includes the steps of extracting shape 
information and texture information from the position, grey-level value and 
the like of landmark points defined for each feature of the face from a large 
number of two-dimensional face images, applying principal component 
analysis on each of the plurality of extracted shape information and texture 
information, and obtaining an orthonormal coordinate system (image space) 
corresponding to change in the shape and face expression, and altering the 
parameters along respective coordinate axes of the obtained image space to 
synthesize a face image. When an AAM is employed in an image 
recognition method, the parameters are altered within an obtained image 
space to synthesize an image. The parameters of the synthesized image 
that has the smallest difference from the face image that is the subject of 



recognition are obtained. By measuring the distance from the landmark 
vector of a face image that is already registered with the obtained 
parameters, recognition is made whether the image matches or not. 

Japanese Patent Laying-Open No. 6-168317 discloses an individual 
identification apparatus taking into account inclination of the face. 
According to this individual identification apparatus, the position 
information of the landmark point of a face image is obtained to calculate 
the leftwards and rightwards rotation angle on the basis of the obtained 
position information and depth information of a reference face model. The 
position information of the landmark point is corrected so that the rotation 
angle becomes zero according to the obtained rotation angle and depth 
information. The individual identification process is carried out using the 
corrected landmark points. This individual identification apparatus can 
prevent reduction in the recognition rate of an input face image even if there 
is rotational difference leftwards or rightwards (the direction of turning the 
neck horizontally) between the face image that is registered in advance and 
the input face image. 

The foregoing AAM employs a two-dimensional image. It is to be 
noted that shape information corresponds to the two-dimensional 
coordinates of a landmark point defined for each feature of a face. There 
was a problem that, from the extracted shape information, difference in the 
inclination of the face image in the depth direction could not be made such 
as whether the two-dimensional face image corresponds to a face oriented 
frontwards, upwards, or downwards. For example, the two-dimensional 
face image of a person with a wide forehead will be represented with a small 
forehead when the face is inclined upwards. If distinction of the face image 
in the depth direction cannot be made, the shape information extracted from 
the two-dimensional face image will erroneously represent the shape of a 
person with a small forehead. 

The art disclosed in the foregoing Japanese Patent Laying-Open No. 
6-168317 can accommodate difference in the rotation angle of the face in the 
horizontal direction in image recognition. However, there was a problem 
that upward or downward rotation, i.e. inclination in the depth direction, 



could not be accommodated. 

As to an image obtained by shooting in a dark place and an image 
obtained by shooting in a light place, the object, even though identical, will 
be represented in different colors in the image. To remove this difference in 
the illumination condition, normalization of the light intensity was effected 
conventionally. In that case, there was a problem that difference in the 
color of the skin could not be identified. In other words, there was a 
problem that the tendency of the characteristic feature of a face arising from 
difference in nationality could not be extracted. 
SUMMARY OF THE INVENTION 

In view of the foregoing, an object of the present invention is to 
provide an image processing apparatus, an image process method, and a 
recording medium recorded with an image processing program that can 
eliminate influence of difference in the image pickup condition from the 
input image. 

Another object of the present invention is to provide an image 
processing apparatus, an image processing method, and a recording medium 
recorded with an image processing program that can synthesize an image of 
an object, eliminated with influence of difference in the image pickup 
condition from the input image. 

A further object of the present invention is to provide an image 
processing apparatus, an image processing method, and a recording medium 
recorded with an image processing program that allows recognition of an 
object differing by different image pickup conditions as the same object. 

According to an aspect of the present invention, an image processing 
apparatus includes a landmark amount input unit to input a landmark 
amount of an object included in the input image, an image pickup condition 
input unit to input the image pickup condition of shooting the input image, 
and an image space formation unit to form an image space by applying a 
statistical method on the plurality of landmark amounts input through the 
landmark amount input unit and a plurality of image pickup conditions 
input through the image pickup condition input unit with respect to a 
plurality of object images. 



According to the present invention, an image processing apparatus 
that can eliminate influence of difference in the image pickup condition from 
the input image can he provided. 

According to another aspect of the present invention, an image 
processing apparatus includes a storage unit to store an image space 
generated according to the landmark amount of an object included in the 
image and the image pickup condition of shooting the image, a parameter 
input unit to input a parameter of the image space, and an image synthesis 
unit synthesizing an image according to the parameter input through the 
parameter input unit. 

According to the present invention, an image processing apparatus 
that can synthesize an image of an object eliminated with influence of 
difference in the image pickup condition from the input image can be 
provided. 

According to a further aspect of the present invention, an image 
processing apparatus includes a first storage unit to store an image space 
generated according to a landmark amount of an object included in an image 
and an image pickup condition of shooting the image, a parameter 
optimization unit automatically extracting a first parameter whose 
difference between a first object included in the input image and an image 
synthesized by moving the parameter in the image space is smallest, a 
second storage unit storing a plurality of second objects respectively in 
correspondence with a second parameter in an image space, and a selection 
unit comparing the first parameter with the second parameter to select a 
desired object from the plurality of second objects. 

According to the present invention, an image processing apparatus 
can be provided that allows recognition of an object differing due to different 
image pickup conditions as the same object. 

According to still another aspect of the present invention, an image 
processing method includes the steps of entering a landmark amount of an 
object image included in an input image, entering an image pickup condition 
of shooting the input image, and forming an image space by applying a 
statistical method on a plurality of landmark amounts input at the step of 



entering a landmark amount and a plurality of image pickup conditions 
input at the step of entering an image pickup condition with respect to a 
plurality of object images. 

According to the present invention, an image processing method can 
5 be provided that allows influence of difference in the image pickup condition 
removed from the input image. 

According to a still further aspect of the present invention, a 
recording medium is recorded with an image processing program for a 
computer to execute the steps of entering a landmark amount of an object 

10 included in an input image, entering an image pickup condition of shooting 
the input image, and forming an image space by applying a statistical 
method on a plurality of landmark amounts input at the step of entering a 
landmark amount and a plurality of image pickup conditions input at the 
step of entering an image pickup condition with respect to a plurality of 

15 object images. 

According to the present invention, a recording medium can be 
provided recorded with a image processing program for a computer to 
execute an image process that can eliminate influence of difference in the 
image pickup condition from an input image. 

20 The foregoing and other objects, features, aspects and advantages of 

the present invention will become more apparent from the following detailed 
description of the present invention when taken in conjunction with the 
accompanying drawings. 
BRIEF DESCRIPTION OF THE DRAWINGS 

25 Fig. 1 is a block diagram showing the hardware structure of an 

image processing apparatus according to a first embodiment of the present 
invention. 

Fig. 2 is a block diagram schematically representing the image space 
formation function of an image processing apparatus according to the first 
30 embodiment. 

Fig. 3 is a schematic diagram showing a face image and landmark 

points. 

Fig. 4 represents distribution of a face image oriented frontwards, 
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upwards and downwards in the X1-X2 coordinates of respective landmark 
points. 

Fig. 5 represents distribution of respective landmark points with an 
Z axis provided perpendicular to the X1-X2 coordinates. 
5 Fig. 6 shows a main axis A derived by the principal component 

analysis. 

Fig. 7 is a block diagram schematically showing the image synthesis 
function and image recognition function of the image processing apparatus 
of the first embodiment. 
10 Fig. 8 is a flow chart of the image space formation process carried out 

by the image processing apparatus of the first embodiment. 

Fig. 9 is a flow chart of the image synthesis process carried out by 
the image processing apparatus of the first embodiment. 

Fig. 10 is a flow chart of the image recognition process carried out by 
15 the image processing apparatus of the first embodiment. 

Fig. 11 is a block diagram schematically showing the image 
synthesis function and image recognition function of an image processing 
apparatus according to a second embodiment of the present invention. 

Fig. 12 is a flow chart of the image synthesis process carried out by 
20 the image processing apparatus of the second embodiment. 

Fig. 13 is a flow chart of the image recognition process carried out by 
the image processing apparatus of the second embodiment, 
DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Embodiments of the present invention will be described hereinafter 
25 with reference to the drawings. In the drawings, likewise or corresponding 
components have the same reference characters allotted. 

First Embodiment 

Referring to Fig. 1, an image processing apparatus according to a 
first embodiment of the present invention includes a control unit 100, an 
30 image input unit 101 to input an image of a person's face, an operation unit 
102 for the user of the apparatus to enter data and various instructions, a 
storage unit 103 recorded with a program to be executed by control unit 100, 
and to store information required for control unit 100 to execute a program, 
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an output unit 104 to output an image, and an external storage device 105. 

Control unit 100 is a central processing unit (CPU) to provide overall 
control of the image processing apparatus. Image input unit 101 is an 
image scanner including a linear CCD sensor to read in a photographic 
5 picture or the like with a face to provide two-dimensional face image data. 
It is to be noted that a digital camera or the like can be used to actually shoot 
an individual to provide a two-dimensional face image. Also, image input 
unit 101 may be an input terminal for connection with an external image 
scanner or digital camera, 
y 10 Storage unit 103 includes a ROM to store a program to be executed 

,li by control unit 100, a RAM to temporarily store variables or the like 

-J required to execute a program with control unit 100 ? a hard disk to store 

•T various data, and the like. 

3 Output unit 104 is a display to display the image input through 

y 15 image input unit 101 or an image subjected to image processing. A printer 

□ can be used together with the display. 

External storage device 105 is a magneto-optical disk drive or a 
£g digital video disk drive to read in a program recorded in a recording medium 

3 106 to be executed with control unit 100 or two-dimensional face image data. 

20 A synthesized face image subjected to the image synthesis process with 
control unit 100 and the image recognition result can be written into 
recording medium 106. 

Fig. 2 is a function block diagram schematically showing the image 
space formation function of the image processing apparatus of the first 
25 embodiment. The image processing apparatus includes a landmark data 
input unit 111 to input data of a landmark point of a two-dimensional face 
image input through image input unit 101, an inclination amount input unit 
113 to input the amount of inclination of the two-dimensional face image in 
the depth direction, and an image space formation unit 115 applying 
30 principal component analysis according to the landmark data and 

inclination amount input through landmark data input unit 111 and 
inclination amount input unit 113 to generate an image space represented 
by a basic vector. The basic vector of the image space generated by image 
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space formation unit 115 is stored in storage unit 103. The face image 
input through image input unit 101 and used in generation of the image 
space is stored in storage unit 103 together with the parameter in the image 
space. 

5 Landmark data input unit 111 receives as landmark data the 

coordinates of a landmark point to identify the contour of the face, the eye, 
nose or mouth of the face image input through image input unit 101 and the 
grey-level value of the texture of the face image. The two-dimensional face 
image input through image input unit 101 is shown on the display of output 
O 10 unit 104. The operator views the face image provided on the display of 
output unit 104 and clicks a predetermined position using the mouse of 
CO operation unit 102. The position clicked with the mouse becomes a 

!Jf landmark point. The coordinates and the grey-level value of the texture of 

p the face image corresponding to that landmark point are input as landmark 

^ 15 data. 

o The landmark point will be described in further detail hereinafter. 

0j Fig. 3 schematically shows a face image and the landmark points. 

51 Referring to Fig. 3, a face contour 10, eye contours 30 and 40, a nose contour 

□ 50 and a mouth contour 60 are indicated. A landmark point is defined with 

^ 20 respective contours 10, 30, 40, 50 and 60 as a reference. Eleven landmark 
points 11-21 are identified on the basis of face contour 10. Four landmark 
points 31-34 are identified on the basis of right eye contour 30. Four 
landmark points 41-44 are identified on the basis of left eye contour 40. 
Five landmark points 51-55 are identified on the basis of nose contour 50. 
25 Five landmark points 6 1-65 are identified on the basis of mouth contour 60. 

Only twenty nine landmark points are shown in Fig. 3 for the sake of 
simplification. It is desirable that landmark points sufficient to give the 
feature of a shape are provided. Preferably, approximately 122 landmark 
points should be provided. 
30 As to the inclination amount input unit of Fig. 2, the amount of 

inclination of the face image in the depth information input through image 
input unit 101 is entered. The operator views the face image on the display 
of output unit 104 to determine whether the face is oriented frontwards, 
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upwards or downwards to enter the inclination amount in the depth 
direction through the keyboard of operation unit 102. For example, the 
inclination amount is set to a when facing upwards, to 0 when facing 
frontwards, and to -a when facing downwards (a is positive number). 
5 Image space formation unit 115 carries out principal component 

analysis using a plurality of landmark data input through landmark data 
input unit 111 and a plurality of inclination amounts input through 
inclination amount input unit 113 with respect to face images input through 
image input unit 101. As a result of the principal component analysis, the 

P 10 obtained orthonormal coordinates are stored in storage unit 103 as the 

5 image space. Also, the image data input through image input unit 101, the 

landmark data and inclination amount corresponding to respective image 

3 data, and the parameter in the image space are stored in correspondence in 

%Z storage unit 103. 

43 15 The image processing apparatus of the present embodiment carries 

1^ out the principal component analysis taking into consideration the 

;il inclination amount, differing from the principal component analysis carried 

j3 out in an AAM. For the sake of simplifying the description, Xi and X2 are 

5 respective components of the vectors of the landmark point data aligned in 

O 20 the X and Y coordinates, Si is a landmark point of a face image oriented 

frontwards, S2 is a landmark point of a face image oriented upwards, and S3 
is a landmark point of a face image oriented downwards. 

Fig. 4 shows a distribution of respective landmark points of face 
images oriented frontward, upward, and downward along the X1-X2 
25 coordinates. The circle located at the center represents the distribution of 
landmark points Si of a face image oriented frontwards. The overlapping 
circles at respective sides represent the distribution of landmark points S2 of 
a face image oriented upwards and the distribution of landmark points S3 of 
a face image oriented downwards. Here, the presence of correlation 
30 between components Xi and X2 is observed. However, it is not definite 
whether that correlation arises from inclination of the face or from 
difference in the shape. In other words, determination cannot be made 
whether this difference in distribution is caused by difference in a face 



-9- 



image oriented frontwards, upwards or downwards, or difference in the face 
image itself oriented frontwards. By providing another Z axis, 
determination can be made between distribution of landmark points of a 
face image oriented frontwards and distribution of landmark points of a face 
image oriented upwards or downwards. 

Fig. 5 represents distribution of landmark points with an Z axis 
newly provided perpendicular to the X1-X2 coordinates. Referring to Fig. 5, 
distinction is made among the distribution of landmark points Si of a face 
image oriented frontwards, distribution of landmark points S2 of a face 
image oriented upwards, and distribution of landmark points S3 of a face 
image oriented downwards. This indicates that the correlation between Xi 
and X2 is caused by difference in the orientation of frontwards, upwards or 
downwards. It is appreciated that respective distributions of landmark 
point Si of a face image oriented frontwards, landmark point S2 of a face 
image oriented upwards and landmark point S3 of a face image oriented 
downwards at respective X1-X2 coordinate planes are absent of correlation 
between Xi and X2. 

The result of applying principal component analysis on the 
landmark points of this three-dimensional space (Xi, X2, Z) is shown in Fig. 6. 
Referring to Fig. 6, a main axis A indicating the inclination of the face image 
in the depth direction can be derived from the principal component analysis. 
Therefore, determination can be made that the correlation of the 
distribution of Xi and X2 has originated from face inclination. Thus, an 
axis (H) corresponding to change in a face oriented frontwards can be 
derived independent of the axis (A) corresponding to change in the face 
inclination. In Fig. 6, the H axis represents the plane of X1-X2. 

By altering the component along the obtained main axis A, the face 
image can be altered corresponding to the inclination of the face in the depth 
direction. Also, by adjusting the component along the axis within the 
orthogonal complement with respect to partial space represented by main 
axis A, the face image can be deformed independent of the change by 
inclination in the depth direction. Therefore, an image can be synthesized 
effectively. 
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The image synthesis function and image recognition function will he 
described hereinafter. Fig. 7 is a block diagram schematically showing 
functions of image synthesis and image recognition of the image processing 
apparatus of the present embodiment. Referring to Fig. 7, the image 
5 processing apparatus includes a parameter optimization unit 121 obtaining 
a landmark parameter in the image space of the input face image, an image 
synthesis unit 117 to synthesize an image at the image space based on the 
obtained landmark parameter, and an image recognition unit 119 to select a 
face image of an individual identical to the input face image out of a 

10 plurality of face images prestored in storage unit 103. 

The face image input through image input unit 101 is provided to 
parameter optimization unit 12 1. The image space (basic vector) generated 
at image space formation unit 115, the face image used in generating the 
image space, and the landmark parameter of that image space are stored in 

15 storage unit 103. 

The image synthesized at image synthesis unit 117 is provided from 
output unit 104. The result recognized at image recognition unit 119 is 
provided from output unit 104. "Recognized result" is a selected image or 
individual information such as the name corresponding to that image when 

20 an image is selected at image recognition unit 1 19, and information of "no 
relevance" when an image was not selected. 

Parameter optimization unit 121 compares the face image input 
through image input unit 101 with the image synthesized using 
provisionally set landmark parameters at the image space stored in storage 

25 unit 103. The provisionally set landmark parameter is varied until the 

difference between the image synthesized at the image space and the input 
face image becomes smallest. The landmark parameter corresponding to 
the smallest difference is obtained. Accordingly, the landmark parameter 
for the face image input through image input unit 101 is obtained for each 

30 coordinate axis of the image space. 

Image synthesis unit 117 alters the landmark parameter obtained at 
parameter optimization unit 121 to synthesize an image at the image space 
stored in storage unit 103. Accordingly, a face image oriented frontwards 
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or oriented upwards/downwards can be synthesized and output even in the 
case where the face image input through image input unit 101 is inclined in 
the depth direction. 

Image recognition unit 119 selects a face image whose distance 
between the vector with the landmark parameter obtained at parameter 
optimization unit 121 as the component and the vector with the landmark 
parameter of the face image stored in storage unit 103 as the component 
becomes smaller than a predetermined value out of the face images stored in 
storage unit 103. As this vector distance, the Mahalanobis distance taking 
into consideration data variance can be employed as well as the Euclidean 
distance. 

The face image stored in storage unit 103 is the face image used in 
forming the image space at image space formation unit 119. The face 
image may also be a face image input after configuring the image space at 
image space formation unit 119. 

The image space formation process carried out by the image 
processing apparatus of the present embodiment will be described with 
reference to the flow chart of Fig. 8. A face image of reference is input 
through image input unit 101 (S01). Here, a reference face image refers to 
the face image used in forming an image space at image space formation 
unit 115, 

The reference face image input through image input unit 101 is 
normalized (S02). Normalization means that the size of the input reference 
face image is set to fit a predetermined reference. More specifically, the 
distance between the two eyes of a face image is set to conform to a 
predetermined value. 

The coordinates of the landmark point and the grey-level value of 
the texture for the reference face image input through image input unit 101 
are input by landmark data input unit 111 (S03). Then, the inclination 
amount of the input reference face image in the depth direction is input by 
input unit 113 (S04). 

At step SOS, determination is made whether there is another 
reference face image to be input. This determination is made by the signal 
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input by the user through operation unit 102. In the case where there is 
another reference face image, the foregoing process of steps S01-S04 is 
carried out for the newly input reference face image. When there is no 
other reference face image to be input, control proceeds to step S06. By 
carrying out the process of steps S01-S04 on a plurality of reference face 
images, the landmark data and inclination amount are input for each of the 
plurality of reference face images. 

At step S06 ? principal component analysis is carried out for all the 
landmark data and inclination amount of the reference face image input 
through image input unit 101. The principal component analysis is a well 
known statistical method, so that description thereof will not be provided 
here. Upon extracting the main component at step S06, control proceeds to 
step S07 to determine whether the total of the contribution rate is greater 
than threshold value T or not. The principal component analysis of step 
S06 is repeatedly carried out until the total of the contribution becomes 
larger than threshold value T. 

When the contribution becomes larger than threshold value T, 
control proceeds to step S08 to store the basic vector representing the main 
component in storage unit 103. An image space is formed by the basic 
vector stored in storage unit 103. 

At step S09, the landmark parameter in the image space obtained at 
step S08 is derived for the reference face image input through step SOI. 
The obtained landmark parameter is stored in storage unit 103 in 
correspondence with the reference face image. 

Thus, following formation of an image space, a database of a face 
image is generated including the reference face image used in producing the 
image space and the landmark parameters. The reference face image may 
be face images of completely different individuals, or face images of the same 
individual. In the case where the face images correspond to those of the 
same individual, the distribution of the landmark parameters representing 
the face image of the same individual in the image space is provided in a 
clustering range due to difference in the expression. 

The image synthesis process carried out by the image processing 
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apparatus of the present invention will be described with reference to the 
flow chart of Fig. 9. A processing face image is input through image input 
unit 101 (Si 1). Here, a processing face image refers to a face image that is 
to be subjected to image synthesis. Here, image synthesis is carried out by 
altering the face image of the individual represented in the processing face 
image to synthesize a face image of various expressions. 

At step S12, the input processing face image is normalized. The 
foregoing description applies to normalization. 

At step S13, the landmark parameter of the processing face image is 
optimized. Optimization of the landmark parameter is carried out by 
comparing the image synthesized at the image space using provisionally set 
landmark parameter with the processing face image and altering the 
landmark parameter in a direction where the difference between the images 
becomes the smaller. The optimized landmark parameter is the landmark 
parameter of the synthesized image corresponding to the smallest difference 
between the reference face image and the image synthesized at the image 
space. Accordingly, the landmark parameter of the processing face image 
input through step Sll in the image space is obtained. By altering the 
landmark parameter obtained at step S13, a face image of a different 
expression is synthesized in the image space (SI 4). 

The image synthesis process carried out by the image processing 
apparatus of present embodiment is performed on the basis of the input 
processing face image. Alternatively, the landmark parameter can be 
directly input through manipulation unit 102 for each coordinate axis of the 
image space formed by the image space formation unit. In this case, the 
process of steps S11-S13 is dispensable. 

The image recognition process carried out by the image processing 
apparatus of the present embodiment will be described with reference to the 
flow chart of Fig. 10. The process from steps S21-S23 is identical to the 
process from steps S11-S13 in the image synthesis process of Fig. 9. 
Therefore, description thereof will not be repeated. At step S23, upon 
obtaining the landmark parameter of the processing face image input 
through image input unit 101 in the image space at step S23, the vector with 
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the landmark parameter of the processing face image as the component is 
compared with the vector with the landmark parameter of the reference face 
image stored in storage unit 103 as the component (S24). At step S25, 
determination is made whether the distance from respective vectors is 
5 smaller than a predetermined value L. When smaller than predetermined 
value L, control proceeds to step S26, otherwise to step S27. 

It can be said that the processing face image resembles the reference 
face image more as the distance between the vectors becomes smaller. 
Therefore, determination can be made that the processing face image and 
□ 10 the reference face image correspond to the same individual when the 
f£ distance between the vector of the processing face image and the vector of 

m the reference face image is smaller than threshold value L. 

jQ At step S26, the reference face image compared in step S25 is 

1^ selected and provided to output unit 104. Then, the image recognition 

%S 15 process ends. 

S L At step S27, determination is made whether there is another 

fli reference face image to be subjected to comparison in storage unit 103. 

y When there is a reference face image in storage unit 103, control proceeds to 

q step S24. When there is no more reference face image, the image 

O 20 recognition process ends. 

Here, the reference face image is the face image used in forming the 
image space of image space formation unit 115. The reference face image 
may be a face image input after the image space has been produced at image 
space formation unit 115. 
25 According to the image processing apparatus of the present 

embodiment, principal component analysis is carried out with the . 
inclination amount indicating inclination of the face image in the depth 
direction added to the landmark point data of the input face image to form 
an image space. Therefore, a face image can be synthesized corresponding 
30 to the inclination in the depth direction at the image space. Also, by 
adjusting the component along an axis in an orthogonal complement 
corresponding to the partial space represented by the main axis indicating 
inclination of the face image in the depth direction, the face can be deformed 
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independent of the change due to inclination in the depth direction to 
synthesize an image. As a result, an image can be synthesized effectively. 

Since the similarity with the face image stored in the storage unit is 
determined taking into account inclination of the input face image in the 
depth direction, distinction can be made between a variation mode of the 
same individual caused by difference in inclination of the depth direction 
and the variation mode due to difference in the shape of a different 
individual facing frontwards. The possibility of erroneous recognition due 
to difference in the inclination of the face in the depth direction can be 
reduced. As a result, the accuracy in recognition can be improved. 

In the present embodiment, grey-level value is used as texture 
information. Color data can be also used. 

In the present embodiment, landmark data corresponds to the 
coordinates of a landmark point and the grey-level value of the texture of the 
face image. However, image space formation, image synthesis and image 
recognition can be carried out using only the coordinates of the landmark 
point, or only the texture of a face image. 

In the present embodiment, an image space is produced by carrying 
out principal component analysis with the inclination amount of an object in 
the depth direction as the image pickup condition in the present 
embodiment. The orientation of an object in the left or right direction (the 
direction of the face turning horizontally), or the occupying ratio of the object 
in the input image can be taken as the image pickup condition. In the case 
where the horizontal orientation of the object is taken as the image pickup 
condition, an image of an object oriented frontwards can be synthesized 
based on an image obtained by shooting an object from the side or obliquely 
from the side, not from the front. When the occupying ratio of the object in 
the image is to be taken as the image pickup condition, an image can be 
synthesized or recognized taking into consideration whether the actual face 
of the object is large or not. The image pickup condition can be used 
singularly or in combination. 

Furthermore, the lighting condition or color temperature can be 
added as the image pickup condition. For example, in the case where an 
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image space is formed with the brightness of the illumination taken as the 
image pickup condition, the image space can be formed taking into account 
the skin color of the object. More specifically, principal component analysis 
is carried out with the illumination of a bright or dark level as the image 
pickup condition to produce an image space. Accordingly, the object in the 
image obtained by shooting under the condition where illumination is not 
sufficient will be reduced in density. However, an image of an object with 
light skin color can be synthesized. Also, an image can be synthesized 
using an axis that has correlation with the skin color. 

Furthermore, the input image can be subjected to Fourier transform 
or subjected to wavelet transform to become a wavelet factor so as to be 
employed as the image pickup condition. 

Second Embodiment 

An image processing apparatus according to a second embodiment of 
the present invention is directed to improvement of the parameter 
optimization unit of the image processing apparatus of the first embodiment. 
Components corresponding to those of the image processing apparatus of the 
first embodiment will not be repeatedly described. 

Fig. 1 1 is a block diagram schematically showing the function of 
image synthesis and image recognition of the image processing apparatus of 
the second embodiment. Referring to Fig. 11, an image processing 
apparatus of the second embodiment includes a parameter optimization unit 
12 1A obtaining a landmark parameter of the input face image at an image 
space, an image synthesis unit 117 to synthesize an image at the image 
space based on the obtained landmark parameter, and an image recognition 
unit 1 19 to select a face image identical to the input face image out of a 
plurality of face images prestored in storage unit 103. 

Landmark data input unit 122 enters the coordinates of the 
landmark point of the face image input through image input unit 101 and 
the grey-level value of the texture of the face image as landmark data. 
Inclination amount input unit 123 enters an inclination amount of the face 
image input through image input unit 101 in the depth direction. 

A projection unit 124 projects the landmark vector represented by 
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the landmark data input through landmark data input unit 122 and the 
inclination amount input through inclination amount input unit 123 onto 
the coordinate axes of the image space stored in storage unit 103. A 
landmark parameter is obtained for each coordinate axis by the projection 
onto the coordinate axis of the image space. The obtained landmark 
parameter is provided to image synthesis unit 117 or image recognition unit 
119. 

Fig. 12 is a flow chart of the image synthesis process carried out by 
the image processing apparatus of the second embodiment. A reference 
face image is input through image input unit 101 (S31). The input 
reference face image is normalized (S32). 

Then, the coordinates and the grey-level value of the texture of the 
landmark point of the input reference face image are input (S33). The 
inclination amount of the reference face image in the depth direction is 
input (S34). 

At step S35, the landmark vector represented by the input landmark 
data and inclination amount is projected at the image space stored in 
storage unit 103. As a result, the landmark parameter of the processing 
face image input at step S3 1 in the image space is obtained. By altering the 
obtained landmark parameter, a face image of a different expression in the 
image space is synthesized (S36). 

Fig. 13 is a flow chart of the image recognition process carried out by 
the image processing apparatus of the second embodiment. The process of 
steps S41-S45 is identical to the process of steps S31-S35 of the image 
synthesis process of Fig. 12. Therefore, description thereof will not be 
repeated. At step S45, a landmark parameter of the processing image 
input through image input unit 101 in the image space is obtained. The 
vector with the landmark parameter of the processing face image as the 
component is compared with the vector with the landmark parameter of the 
reference face image stored in storage unit 103 as the component (S46). At 
step S47, determination is made whether the distance from respective 
vectors is smaller than a predetermined value L. When smaller than 
predetermined value L, control proceeds to step S48, otherwise to step S49. 
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Determination is made that the processing face image and the reference face 
image correspond to the same individual when the distance between the 
vector of the processing face image and the vector of the reference face image 
is smaller than threshold value L. 

At step S48, the reference face image compared at step S46 is 
selected and provided to output unit 104. Then, the image recognition 
process ends. 

At step S49 ? determination is made whether another reference face 
image to be subjected to comparison is stored in storage unit 103 or not. 
When there is another reference face image stored in storage unit 103, 
control proceeds to step S46. In the case where there is no other reference 
face image, the image recognition process ends. 

In the first and second embodiments, input of the inclination amount 
is carried out manually. Measurement means such as a laser sensor or 
ultrasonic sensor can be provided to enter the landmark point as three- 
dimensional data. In this case, it is not necessary to input the inclination 
amount since the landmark data corresponds to three-dimensional data. 
An advantage similar to that described in the foregoing can be obtained. 

The process described of the image processing apparatus with 
reference to the flow charts of Figs. 7, 8, 9, 11 and 12 is applicable to an 
image processing method or a recording medium recorded with an image 
processing program executing the same process. 

Although the present invention has been described and illustrated in 
detail, it is clearly understood that the same is by way of illustration and 
example only and is not to be taken by way of hmitation, the spirit and scope 
of the present invention being limited only by the terms of the appended 
claims. 
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WHAT IS CLAIMED IS: 



1. An image processing apparatus comprising: 

a landmark amount input unit to input a landmark amount of an 
object image included in an input image, 

an image pickup condition input unit to input an image pickup 
condition of shooting said input image, and 

an image space formation unit to form an image space by applying a 
statistical method on a plurality of said landmark amounts input through 
said landmark amount input unit and a plurality of image pickup conditions 
input through said image pickup condition input unit with respect to a 
plurality of object images. 

2. The image processing apparatus according to claim 1, wherein 
said plurality of landmark amounts input through said landmark amount 
input unit include a plurality of coordinate values to identify a shape of said 
object image. 

3. The image processing apparatus according to claim 1, wherein 
said plurality of landmark amounts input through said landmark amount 
input unit include a plurality of grey- level values of texture of said object 
image. 

4. The image processing apparatus according to claim 1, wherein 
said image pickup condition input through said image pickup condition 
input unit includes brightness of illumination during shooting. 

5. The image processing apparatus according to claim 1, wherein 
said image pickup condition input through said image pickup condition 
input unit includes inclination of said object image included in said input 
image in a depth direction. 

6. An image processing apparatus comprising: 
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a storage unit to store an image space generated according to a 
landmark amount of an object image included in an image and an image 
pickup condition of shooting said image, 

a parameter input unit to input a parameter at said image space, 

and 

an image synthesis unit to synthesize an image according to the 
parameter input through said parameter input unit. 

7. The image processing apparatus according to claim 6, wherein 
said parameter input unit includes a parameter optimization unit to 
automatically extract a parameter whose difference between an input image 
and a synthesized image obtained by moving the parameter in said image 
space becomes smallest. 

8. The image processing apparatus according to claim 6, wherein 
said parameter input unit includes a projection unit to project said 
landmark amount input through said landmark amount input unit and said 
image pickup condition input through said image pickup condition input 
unit onto said image space to obtain a parameter. 

9. An image processing apparatus comprising: 

a first storage unit to store an image space generated according to a 
landmark amount of an object image included in an image and an image 
pickup condition of shooting said image, 

a parameter optimization unit to automatically extract a first 
parameter whose difference between a first object image included in said 
input image and a synthesized image obtained by moving a parameter in 
said image space becomes smallest, 

a second storage unit to store a plurality of second object images 
respectively in correspondence with a second parameter in said image space, 
and 

a select unit to compare said first parameter with said second 
parameter to select a desired object image out of said plurality of second 



object images. 

10. An image processing method comprising the steps of: 
entering a landmark amount of an object image included in an input 

image, 

entering an image pickup condition of shooting said input image, 

5 and 

forming an image space by applying a statistical method on a 
plurality of said landmark amounts input at said step of entering a 
landmark amount and a plurality of said image pickup conditions input at 
said step of entering an image pickup condition with respect to a plurality of 
1 0 obj ect images . 

11. A recording medium recorded with an image processing 
program for a computer to execute the steps of: 

entering a landmark amount of an object image included in an input 

image, 

5 entering an image pickup condition of shooting said input image, 

and 

forming an image space by applying a statistical method on a 
plurality of said landmark amounts input at said step of entering a 
landmark amount and a plurality of said image pickup conditions input at 
10 said step of entering an image pickup condition with respect to a plurality of 
object images. 
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ABSTRACT OF THE DISCLOSURE 

In order to eliminate influence due to difference in the image pickup 
condition from an input image, an image processing apparatus includes a 
landmark point data input unit to input coordinates of a landmark point of a 
5 face image input through an image input unit and the grey-level value of 
texture of a face image, an inclination amount input unit to input an 
inclination amount of a face image in the depth direction, and an image 
space formation unit forming an image space by carrying out principal 
component analysis on a plurality of landmark point data and a plurality of 
10 inclination amounts with respect to a plurality of the input face images. 
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